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Notes on the Missing Satellites Problem 

James. S. Bullock (UC Irvine) 



Abstract 

The Missing Satellites Problem (MSP) broadly refers to the overabundance 
of predicted Cold Dark Matter (CDM) subhalos compared to satellite galax- 
ies known to exist in the Local Group. The most popular interpretation of 
the MSP is that the smallest dark matter halos in the universe are extremely 
inefficient at forming stars. The question from that standpoint is to identify 
the feedback source that makes small halos dark and to identify any obvious 
mass scale where the truncation in the efficiency of galaxy formation occurs. 

Among the most exciting developments in near-field cosmology in recent 
years is the discovery of a new population satellite galaxies orbiting the Milky 
Way and M31. Wide field, resolved star surveys have more than doubled 
the dwarf satellite count in less than a decade, revealing a population of 
ultrafaint galaxies that are less luminous that some star clusters. For the 
first time, there are empirical reasons to believe that there really are more 
than 100 missing satellite galaxies in the Local Group, lurking just beyond 
our ability to detect them, or simply inhabiting a region of the sky that has 
yet to have been surveyed. 

Remarkably, both kinematic studies and completeness-correction studies 
seem to point to a characteristic potential well depth for satellite subha- 
los that is quite close to the mass scale where photoionization and atomic 
cooling should limit galaxy formation. Among the more pressing problems 
associated with this interpretation is to understand the selection biases that 
limit our ability to detect the lowest mass galaxies. The least massive satel- 
lite halos are likely to host stealth galaxies with very-low surface brightness 
and this may be an important limitation in the hunt for low-mass fossils 
from the epoch of reionization. 
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1.1 Perspective and Historical Context 

These lecture notes focus on a potential problem for the standard model of 
structure formation, but before delving into this issue in depth, it is worth 
providing a bit of perspective. Cosmological observations ranging from the 
cosmic microwave background (Komatsu et al. 2010) to large-scale galaxy 
clustering (Reid et al. 2010) to the Lyman-a forest (Viel et al. 2008) all 
point to a concordance cosmological model built around Dark Energy + Cold 
Dark Matter (ACDM). Specifically, direct constraints on the power spectrum 
of density fluctuations suggest that structure formation in the universe is 
hierarchical (Press & Schechter 1974; White & Rees 1978), at least down 
to the scale of large galaxies. This is a remarkable fact and, for those of 
us interested in galaxy formation, it provides some confidence that we are 
not wasting our time attempting to build galaxies within the scaffolding 
provided by a CDM-based cosmology (Blumenthal et al. 1984; Davis et al. 
1985). On still smaller scales, stellar streams and clumps appear to fill the 
stellar halos of the Milky Way and M31 (Ivezic et al. 2000; Newberg et 
al. 2002; Ferguson et al. 2002; Majewski et al. 2003; Zucker et al. 2004; 
Guhathakurta et al. 2006; Ibata et al. 2007; Bell et al. 2008; Watkins et 
al. 2009; McConnachie et al. 2009) in accordance with expectations from 
ACDM-based models of dwarf galaxy accretion and disruption (Bullock, 
Kravtsov & Weinberg 2001; Bullock & Johnston 2005; Cooper et al. 2010). 
Even if the underlying makeup of the universe is somehow different than 
this, it must look very much like ACDM from the scale of the horizon down 
to ~ 50 kpc or so. It is within this context that we consider the Missing 
Satellites Problem in CDM. 

A defining characteristic of CDM-based hierarchical structure formation 
is that dark matter halos are assembled via mergers with smaller systems. 
In the standard framework of CDM, density inhomogeneities in the early 
universe are small compared to unity and characterized by a Gaussian dis- 
tribution with variance a. If we consider density fluctuations within spheres 
of radius R and associated mass M cc R 3 , we can characterize the linear 
fluctuation amplitude as 

a(M, a) oc D(a) F{M) (1.1) 

where a = (1 + z) _1 is the expansion factor and D(a) quantifies the gravita- 
tional amplification of fluctuations, which grow as D(a) oc a at early times 
when the universe is near critical density. CDM predicts that F(M) bends 
as F oc M _a with a = 2/3 on large scales to a — > (logarithmically) on the 
smallest mass scales (Blumenthal et al. 1984). At any given redshift, the 
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typical mass-scale that is collapsing, M*(a), can be estimated by determining 
the non-linear scale via a(M*,a) ~ 1, which implies M*(z) ~ D(a) l l a . This 
means low-mass scale fluctuations eventually break off from the expansion 
first. More massive objects are assembled later (at least in part) by many 
smaller-scale pre-collasped regions (Press & Schechter 1974). The normal- 
ization of the power spectrum is often quantified by a% = a(M = Ms, a = 1), 
whre Ms is the mass enclosed within a sphere of radius 8/i _1 Mpc. Structure 
of a given mass tends to collapse earlier for a higher value of as- 

One implication of this hierarchical scenario is that small dark matter 
halos collapse at high redshift, when the universe is very dense, and as a 
result have high density concentrations. When these halos merge into larger 
hosts, their high core densities allow them to resist the strong tidal forces 
that would otherwise act to destroy them. While gravitational interactions 
act to strip mass from merged progenitors, a significant fraction of these 
small halos survive as bound substructures (see, e.g. Zentner & Bullock 
2003). 

Kauffmann, White, & Guiderdoni (1993) were the first to show using a 
self-consistent treatment that substructure would likely survive the merging 
process with abundance in CDM halos. They used a semi-analytic model 
to show that subhalos within cluster-size dark matter halos will have ap- 
proximately the correct number and mass spectrum to account for cluster 
galaxies. On the other hand, the same calculation demonstrated that Milky- 
Way size halos should host a large number of satellite subhalos, with over 
~ 100 objects potentially massive enough to host observable satellite galax- 
ies (with L > W e L@), even when allowing for fairly substantial supernovae 
feedback. Given that there are only ~ 10 satellites brighter than this around 
the Milky Way, the authors concluded that dwarf galaxy formation would 
need to be suppressed by some large factor in order to explain the dis- 
crepancy. Nevertheless, this dwarf satellite issue did not gain significant 
attention in the community until these initial semi-analytic estimates were 
(more or less) verified by direct numerical calculations some ~ 5 years later 
(Klypin et al. 1999a; Moore et al. 1999). It was only then that the Missing 
Satellites Problem (MSP) gained urgency. 

Implicit in the title Missing Satellites is the notion of prediction - the 
satellites are expected by theory but are not seen. It is not surprising then 
that the problem was named by theorists. Kylpin et al. (1999a) titled their 
seminal paper on the topic "Where are the Missing Galactic Satellites?". 
The notion is that CDM is a successful theory and that the substructure 

f A strict empiricist may have called it "The Overabundant Substructure Problem" , but this is 
not what the MSP is now called even though many may interpret it this way. 



4 



prediction is robust and to be taken seriously. A similar sentiment was 
echoed in the nearly coeval paper by Moore et al. (1999): This is a robust 
prediction of CDM, so why don't we see more dwarfs? It is indeed the 
robustness of the Klypin et al. and Moore et al. calculations that demanded 
attention to the problem that was sketched by Kauffmann et al. (1993). 
More recent simulations, now with a factor of ~ 10 3 more particles, have 
effectively confirmed these first calculations (Diemand et al. 2008; Springel 
et al. 2008). The mass function of substructure is predicted to rise steeply 
to the smallest masses, while the luminosity function of observed dwarf 
satellites is fairly flat. In this set of lecture notes, I will touch on some of the 
physical mechanisms that may act to suppress galaxy formation within the 
smallest halos, but will focus primarily on the best way compare predictions 
with observations. Specifically, the zeroth order concern for any ACDM (or 
variant) model that wishes to match the Local Group satellite population 
is to correctly reproduce the mass-luminosity relationship obeyed by faint 
galaxies. This necessarily relies on mass determinations. Once we have 
robust mass determinations for the dwarfs, we can begin to test specific 
models for their formation and evolution. Section 1.4 provides an overview 
of recent efforts to constrain the mass-luminosity relation for dwarf galaxies. 

The most important observational development in MSP studies in the 
last decade has been the discovery of a new population of faint satellite 
galaxies. Approximately twenty- five new dwarf galaxy companions of the 
Milky Way and M31 have been discovered since 2004, more than doubling 
the known satellite population in the Local Group in five years (Willman et 
al. 2005; Zucker et al. 2006; Grillmair 2006, 2009; Majewski et al. 2007; 
Belokurov et al. 2007, 2009; Martin et al. 2009; see Willman 2010 for a 
recent review). The majority of these newly-discovered dwarfs are less lu- 
minous than any galaxy previously known. The most extreme of these, the 
ultrafaint Milky Way dwarfs, have luminosities smaller than an average 
globular cluster L ~ 10 2 — 10 4 L Q , and were discovered by searches for 
stellar overdensities in the wide-field maps of the Sloan Digital Sky Survey 
(SDSS) and the Sloan Extension for Galactic Understanding and Explo- 
ration (SEGUE). Follow-up kinematic observations showed that these tiny 
galaxies have surprisingly high stellar velocity dispersions for their lumi- 
nosities and sizes (a+ ~ 5kms _1 ; Martin et al. 2007; Simon &: Geha 2007; 
Simon et al. 2010) and subsequent mass modeling has shown that they are 
the most dark matter dominated galaxies known (Wolf et al. 2010; Martinez 
et al. 2010). Remarkably, these extreme systems are not only the faintest, 



f We will assume V-band luminosities throughout 
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most dark matter dominated galaxies in the universe but they are also the 
most metal poor stellar systems yet studied (Kirby et al. 2008; Geha et al. 
2009). All of the new dwarfs, with the exception of Leo T at a distance 
of more than 400 kpc, have negligible atomic hydrogen fractions (Grcevich 
& Putman 2009). Perhaps the most exciting aspect of these recent discov- 
eries is that completeness corrections point to a much larger population of 
undiscovered dwarfs (Koposov et al. 2008; Tollerud et al. 2008; Walsh et al. 
2009; Bullock et al. 2010). For the first time we have empirical evidence to 
suggest that there really are missing satellite galaxies in halo of the Milky 
Way and this is the subject of Section 1.5. These missing objects are likely 
extremely dark matter dominated, but too dim and/or too diffuse to have 
been discovered (yet). 



1.2 Counting and Defining Dark Matter Halos in CDM 

1.2.1 Galaxy Halos 

The standard paradigm of galaxy formation today posits that all (or at least 
an overwhelming majority) of galaxies reside near the centers of extended 
dark matter halos. The outer edges of the halos are somewhat arbitrarily 
defined, but a typical convention is that halos in the field are characterized 
by a radius i? n and mass M n that obey M n = Ah p m (4-7T /3) R^ , such that 
average density within the halo exceeds the background p m by a factor 
Ah ~ 200, which is motivated by the virialized overdensity obtained in 
approximate spherical collapse models (see, e.g. Bryan &: Norman 1998). 
With mass defined in this way, N-body simulations reveal that the halo 
mass function is remarkably well characterized by (see Tinker et al. 2008 
and references therein): 

. . p m diner' 1 
M a > T72 ji„ > ( L2 ) 



dM h JtiK ' Ml dlnM h ' 

where fh(o~) is effectively a fitting function that approaches a constant at low 
mass (Md ^ M# or a S> 1) and becomes exponentially suppressed for halos 
that are too large to have collapsed (M<j 3> M* or a <C 1)qJ One important 
observation (which has been recognized at least since White & Flees 1978) is 
that the mass function rises steadily towards smaller masses dn/dM ~ M -2 , 

f For our purposes the precise value of does not matter, but in principle one needs to be 
careful about the definition when doing comparisons, especially when considering halos more 
massive than Mj, ~ M*. 

\ Note that the linear mass variable M in a(M) has somewhat magically replaced by the virial 
mass variable M — > in this equation. It is not outrageous to think that the two should be 
related, but an exact correspondence would be bizarre given the simplicity of spherical collapse. 
The function /h(f) can be regarded as a fudge factor that makes up for this difference. 
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Fig. 1.1. Relationship between stellar mass M* and halo mass Mh as quantified 
by the abundance matching analysis of Behroozi et al. (2010), which includes 
uncertainties associated with observed stellar mass functions and possible scatter 
in the relationship between halo mass and stellar mass (shaded band). The lower 
dashed line is the extrapolated relationship that results from assuming the stellar 
mass function continues as a power law for M* < 10 8 ' 5 M©, using data from the 
full SDSS (Li & White 2009). The upper dashed line is extrapolated under the 
assumption that the slope of the stellar mass function has an upturn at low masses 
to a = —1.8, as found by Baldry, Glazebrook, & Driver (2008). 

while the luminosity function or baryonic mass function of galaxies in the 
real universe does not rise as steeply. Clearly a one-to-one correspondence 
between mass and luminosity must break down at small masses. 

A more precise way of characterizing the relationship between galaxies 
and their halos comes from the technique known as abundance matching. 
One simply asks the question: what mass-luminosity relationship must I 
impose in order to reproduce the observed luminosity function of galax- 
ies? Remarkably, two-point clustering statistics of bright galaxies can be 
explained fairly well under the simple assumption that galaxy luminosity L 
or stellar mass M* maps to dark matter halo mass m a nearly mono- 
tonic way (see Kravtsov et al. 2004; Conroy & Wechsler 2009; Moster et al. 
2010; Behroozi et al. 2010 and references therein). Figure [TTT1 illustrates the 
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required mapping by showing the ratio of galaxy stellar mass to halo virial 
mass (M^/Mh) as a function of halo mass (P. Behroozi, 2010, private com- 
munication). Dark gray shading indicates statistical and sample variance 
errors and the light gray shading includes systematic errors. The shaded 
band is truncated at small stellar masses where the mass functions become 
incomplete in the Sloan Digital Sky Survey (M* ~ 1O 8,5 M ). It is clear from 
this figure that the efficiency of galaxy formation must peak in halos with 
masses ~ 1O 12 M , and to become increasingly inefficient towards smaller 
and larger halo masses. An understanding of this behavior - how and why 
it happens - remains a fundamental goal in galaxy formation. 

The two dashed lines in Figure 1 bracket reasonable low-mass extrapola- 
tions of the relationship defined at larger masses, with the upper line defined 
by a power-law extrapolation of the observed stellar mass function from Li 
& White (2009) and the lower dashed corresponding to a case where the 
stellar mass function has an upturn, as advocated by Baldry et al. (2008). 
At small masses, the implied range of scalings is M* oc Mf t with p ~ 2.5 — 1.8. 
For example, at the edge of the figure, the power-law extrapolation suggests 
that Mh = 1O 9 M halos should host galaxies with stellar masses between 
M* ~ 5 x 1O 4 M and M* ~ 3 x 1O 5 M . As we shall see, halo masses 
of Mh ~ 1O 9 M (V max ~ 30km s _1 ) are approximately those required to 
match the kinematics of L ~ 1O 5 L dwarf galaxies (Strigari et al. 2008). Of 
course, there is no real physics in this extrapolation, but it is encouraging 
that when one imposes this simple, empirically motivated scaling, we get 
numbers that are not far off from those required to match dwarf satellite 
observations (Busha et al. 2010; Kravtsov 2010). A more general com- 
parison of abundance-matching masses and kinematically-derived masses is 
presented in Tollerud et al. (2010). 



1.2.2 Subhalos 

Galaxies that are as faint as L ~ 1O 5 L are very difficult to detect, and, 
as we discuss below (see Figure ll.lOp , we are only reasonably confident in 
our census of these objects to distances comparable to the expected virial 
radius of the Milky Way's halo, and within small, deeply surveyed regions 
within the virial radius of M31 (Majewski et al. 2007; Ibata et al. 2007; 
McConnachie et al. 2008; Martin et al. 2009). In addition, these galaxies 
appear to be embedded within dark matter halos (Mateo 1998; Simon & 
Geha 2007; Walker et al. 2009; Wolf et al. 2010; Kalirai et al. 2010; Collins 
et al. 2010). The implication is that in order to confront the faintest galaxies 
theoretically in the context of ACDM, we must discuss them as embedded 
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within subhalos - dark matter halos that are bound and within of a 
larger host halo. 

Subhalos (unlike field halos) are not naturally characterized by a virial 
mass, M n , because they tend to be truncated by tidal forces at radii smaller 
than their over-density-defined virial radii (e.g., Klypin et al. 1999b; Kazantzidis 
et al. 2004; Penarrubia et al. 2008). The most common way one charac- 
terizes a subhalo is to use V^nax, the maximum value of the circular velocity 
curve V c (r) = (GM{r) /r) 1 ' 2 , which itself is measured in spheres, stepping 
out from the subhalo center of mass. The radius r max where V^ax occurs is 
typically set by tidal stripping, which more readily unbinds material from the 
outer parts of galaxy halos than the inner parts (Bullock &: Johnston 2007; 
Kazantzidis et al. 2010). The choice of V^ax as a quantifier is particularly 
nice because it is fairly robust to the details of the halo finding algorithm 
used. Another choice is to assign subhalos a mass directly. Unfortunately, 
total mass assignment to subhalos is somewhat subjective and can result in 
differences from halo finder to halo finder, but reasonable choices involve 1) 
defining the subhalo boundary at the radius where its density profile approx- 
imately reaches the mean background density of the host halo; 2) estimating 
a local tidal radius iteratively; or 3) removing unbound particles from the 
subhalo iteratively in such a way that a boundary emerges fairly naturally 
(see, e.g. Springel et al. 2008 Figure 14 and related discussion). 

Figure 11.21 summarizes cumulative subhalo Vm ax functions from the three 
highest resolution ACDM simulations of Milky- Way size halos that have yet 
been run (VL II - Diemand et al. 2008; Aq-A - Springel et al. 2008; and 
GHALO - Stadel et al. 2009; Kuhlen et al. 2010). Each simulation contains 
N p ~ 10 9 particles within the halo radius i? n = i?2oc0 an d the subhalos 
plotted are those within i? n - Because the three halos have slightly different 
masses (M h = 1.3, 1.9, and 2.5 x 1O 12 M , for GHALO, VL II, and Aq-A, 
respectively) the V^ax values in Figure 11.21 have been scaled to Vh = V200 = 
(GM h /R h ) 1 / 2 for each system, where R h = 347, 402, 433 kpc and V h ~ 127, 
142, and 157kms _1 , for the three hosts. The shaded band summarizes the 
results of a series of lower resolution simulations (N p ~ 2 x 10 8 ) presented in 
Springel et al. (2008). The flattening seen in the cumulative Vmax functions 
are entirely a result of resolution limits, as can be seen by comparing the 
shaded band of lower resolution Aq halos to the dashed line, which is the 
highest resolution Aq halo. 

The most striking feature of the comparison shown in Figure 11.21 is that 
the simulation lines basically agree. The small normalization shift towards 



f defined via A = 200 over the background density 



Notes on the Missing Satellites Problem James. S. Bullock (UC Irvine) 9 




Fig. 1.2. Cumulative subhalo V m&K function within Rh — 400 kpc for the three 
highest resolution simulations of Milky- Way size halos from Kuhlen (2010, private 
communication), shown here relative to the circular velocity of each host at its 
outer radius Vh ~ 140 km s -1 . The solid red line shows VL II subhalos from Die- 
mand et al. (2008), the dashed black line shows Aq-A subhalos from Springel et al. 
(2008) and the solid blue line shows GHALO subhalos from Stadel et al. (2009). 
The shaded band provides an estimate of halo-to-halo scatter from a series of lower 
resolution halos (Aq suite, Springel et al. 2008). The small normalization difference 
between the GHALO/ VL II and Aq halos is likely a result of their having different 
power spectrum normalizations (Zentner & Bullock 2003). The most recent de- 
termination of the normalization WMAP-7 (Komatsu et al. 2010) is intermediate 
between the adopted normalizations of the simulations shown here. 



higher number counts for the Aq halos is likely due to the fact that the Aq 
runs use a higher normalization (<jg = 0.9) than the VL-II and GHALO runs 
(as = 0.74). The difference seen is in line with the analytic expectations of 
Zentner & Bullock (2003), who explored how changes to the power spectrum 
should affect substructure counts. Note that the most recent WMAP-7 
preferred value for the normalization is os = 0.81, which is intermediate 
between the two sets of simulations shown here. Overall, the V mSiX functions 
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are well-fit by a power law: 



( 



) 



-2.94 



A^(> V max ) ^ 0.15 



max 



(1.3) 



where the normalization has been chosen to be intermediate between the 
two sets of simulations shown. For a Milky- Way size host (Vh — 140 km s ; 
Xue et al. 2008; Gnedin et al. 2010) we expect N ~ 350 subhalos with 
Vmax > 10km s -1 within R<i ~ 400 kpc of the Milky Way. This number 
matches well with the predictions from the first MSP papers more than a 
decade ago (Klypin et al. 1999; Moore et al. 1999). What we also see is 
that the count is expected to rise for smaller halos, and there is no sign of a 
physical break. The highest resolution simulation shown in Figure 11.21 has 
already resolved more than 10 5 subhalos, and there is no reason to expect 
the numbers to stop rising in the absence of another scale in the problem. 

The only other obvious scale in CDM is the filtering mass in the power 
spectrum, where fluctuations are suppressed on the scale of the comoving 
horizon at the time of kinetic decoupling of the dark matter in the early 
universe (Loeb & Zaldarriaga 2005; Bertschinger 2006): 



The kinetic decoupling temperature depends on the scattering interac- 
tions of dark matter with standard model fermions, and is therefore sensi- 
tive to the microphysical dark matter model itself (Profumo et al. 2006). 
Martinez et al. (2009) showed that the popular Constrained Minimal Super- 
symmetric Standard Model (CMSSM) with neutralino dark matter allows 
M m i n ~ 10 -9 — 10 -6 M & , after accounting for the entire presently viable 
parameter space of the CMSSM, including low-energy observables, the relic 
abundance (p m ), and direct constraints on standard model quantities. The 
implication is that there is good reason to suspect that the mass spectrum 
of substructure in the Milky Way halo continues to rise down to the scale of 
an Earth mass, and even smaller. 

Figure 11.31 provides a cartoon-style estimate of the total substructure 
abundance within the Milky Way's halo down to the scale of the mini- 
mum mass. Plotted is the iV(> Vmax) relation from Equation 11.31 for a 
Milky-Way size host. The line becomes dashed where we are extrapolat- 
ing the power-law beyond the resolving power of state-of-the-art numeri- 
cal simulations. The upper axis provides a mass scale using bound-mass 
to Vmax relation for subhalos found by Springel et al. (2008): M su b = 
3.4 x lO 7 M0(V m ax/TOkms~ 1 ) 3 ' 5 . The vertical red band provides an indi- 




(1.4) 
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Fig. 1.3. Illustrative sketch of the expected cumulative CDM substructure abun- 
dance within the Milky Way's halo. The line is Equation 11.31 for a host halo of 
circular velocity V\ x = 140kms _1 at Rh = 400 kpc (for which we would expect a 
maximum circular velocity of about 200 kms -1 ). The line becomes dashed where 
we are extrapolating the power-law beyond the resolving power of state-of-the-art 
numerical simulations. The vertical red band provides an indication of where we ex- 
pect this power-law to break for popular CDM dark matter candidates. The lower 
horizontal dashed line shows the number of Milky Way satellite galaxies known 
while the upper dashed line is an estimate of the total number of satellite galaxies 
that exist within 400 kpc of the Milky Way corrected for luminosity-bias and sky 
coverage limitations of current surveys (Tollerud et al. 2008). The fact that the 
upper horizontal line intersects the edge of the vertical blue band at about the 
location of the CDM prediction is quite encouraging for the theory. 

cation of where we expect this power-law to break. Here, I have taken the 
M m { n range from Martinez et al. (2009) and expanded it slightly to allow 
for uncertainties associated with going form the linear power spectrum to 
subhalos, and to qualitatively allow for more freedom in the dark matter 
model (e.g. Profumo et al. 2006). For comparison, the magenta band shows 
the mass (or velocity) scale that has been directly resolved in simulations 
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and the vertical blue band on the far right indicates current range of Vmax 
values that are consistent with kinematic constraints from observed Milky 
Way satellite galaxies (as discussed below and in Peharrubia et al. 2008). 

Based on this extrapolated power-law and the range of minimum subhalo 
masses, we expect a total of ~ 10 11 to 10 17 bound subhalos to exist within 
400 kpc of the Sun! Compared to the total number of satellite galaxies known 
(~ 20; lower dashed line) or even an observational completeness corrected 
value (~ 400; upper dashed line from Tollerud et al. 2008), we are very far 
from discovering all of the subhalos that are expected in CDM. 

The most extreme way one might characterize the MSP is to say that 
CDM predicts > 10 11 subhalos while we observe only ~ 10 2 satellite galaxies. 
While this statement is true, it should not worry anyone because the vast 
majority of those CDM subhalos are less massive than the Sun. No one 
expects a satellite galaxy to exist within a dark matter halo that is less 
massive than a single star. We know that galaxy formation must truncate 
in dark matter structures smaller than some mass. The question is what is 
that mass? Is there a sharp threshold mass, below which galaxy formation 
ceases? Can we use the Local Group satellite population as a laboratory for 
galaxy formation on small scales? 

Physical processes like supernova feedback (Vmax ~ 100 km s -1 ), heat- 
ing from photoionization (Vmax ~ 30kms _1 ), or the ability for gas to cool 
(Vmax ~ 15kms _1 for Lyman-alpha and ~ 5kms _1 for H2) each imposes 
a different mass scale of relevance (Dekel & Silk 1986; Efstathiou 1992; 
Tegmark et al. 1997; Bullock et al. 2000; Ricotti et al. 2001). If, for ex- 
ample, we found evidence for very low-mass dwarf galaxies Vmax ~ 5kms _1 
then these would be excellent candidates for primordial H2 cooling 'fossils' 
of reionization in the halo (Madau et al. 2008; Ricotti 2010). We would like 
to measure the mass-scale of the satellites themselves and determine if the 
number counts of observed satellites are consistent with what is expected 
given their masses. Unfortunately, it turns out that this comparison is not 
as straightforward as one might naively expect. 

1.3 Defining the Problem 
1.3.1 Maximum Circular Velocity Vmax 
From the perspective of numerical simulations, a Vmax function such as that 
shown in Figure 11.21 provides a natural and robust way to characterize the 
substructure content of a dark matter halo. Ideally, we would like to put 

f A conservative requirement would be M\y > (1//j)Mq ~ 5Mq in order to host a "galaxy" 
containing one solar mass star 
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the satellite galaxies of the Milky Way and M31 on this plot in order to 
make a direct comparison between theory and data. Unfortunately, V ma _ x is 
not directly observable for the dwarf satellites. Instead, the stellar velocity 
dispersion, <r*, is the most commonly-derived kinematic tracer for dwarf 
spheroidal (dSph) satellite galaxies in the Local Group. dSph velocity 
dispersion profiles tend to be flat with radius (Walker et al. 2009) but for 
concreteness, I will use cr* to denote the luminosity-weighted line-of-sight 
velocity dispersion of the system. 

The first MSP papers estimated V max values for the dwarfs by assuming 
that their stellar velocity dispersion was equal to the dark matter veloc- 
ity dispersion within the central regions of their host halos: <t* = = 
Vm*jV2 (Moore et al. 1999) and a* = ~ V max /V3 (Klypin et al. 

1999). This was a reasonable start, especially given that the main point 
was to emphasize an order-of-magnitude discrepancy in the overall count 
of objects. Nevertheless, for detailed comparisons it is important to realize 
o"* 7^ c"dm- El Specifically, since the stars are bound to the central regions of 
dark matter halos, while the dark matter particles can orbit to much larger 
radii, we generally expect a+ < o"d m - This fact motivated the idea that the 
dwarf galaxies could be more massive than originally supposed, and per- 
haps populate only the most massive subhalos of the Milky Way (Stoehr et 
al. 2002; Hayashi et al. 2003; and Kravtsov et al. 2004b, who discussed 
a more nuanced model associated with mass prior to accretion). Below I 
demonstrate explicitly that the only relationship that can be made without 
imposing a theoretical model for subhalo structure is V max > V3o-± (Wolf et 
al. 2010). Cases where V max ~ V3a* correspond to those where the edge of 
the stellar distribution is very close to r max . Because r max is typically set by 
tidal truncation in CDM halos, this last equivalence is expected to hold in 
cases where the stars have be affected or are beginning to be affected by tidal 
stripping/stirring in the Milky Way (as shown numerically by Kazantzidis 
et al. 2010). 

One of the problems with placing dSph data on a Vmax function plot is 
that the mapping between o+ and V max depends sensitively on what one 
assumes about the density profile structure of the subhalos. The stellar 
velocity dispersion data probe only the potential within the stellar extent, 

X The best studied dwarfs have individual velocity measurements for more than a thousand stars 
(Walker et al. 2009) and this allows for more nuanced studies of their kinematics involving 
velocity dispersion profiles, higher-order moments of the velocity distribution, and explorations 
of the underlying distribution function shape. 

f This simple fact has caused a lot of confusion in the literature, especially when it relates to 
attempts to "measure" phase space densities of dark matter in dwarfs, which cannot be done 
without appealing to theoretical expectations for how and how far the dark matter halos extend 
beyond the stellar extent of the dwarfs. 
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and one must extrapolate beyond that point in order to estimate V max . For 
the current population of well-studied Milky Way dSph galaxies (see Table 
1 in Wolf et al. 2010), the median (3d) half-light radius is ri/ 2 — 300 pc, 
while rotation curves of ACDM subhalos with V maiX > 15kms _1 can peak at 
radii ranging from ~ 1000 — 6000 pc depending on the value of as (Zentner 
& Bullock 2003; Figure 21). For a typical dwarf, this amounts to a factor 
of ~ 3 to ~ 20 extrapolation in the assumed mass profile from the point 
where it is constrained by data (and it can be much worse for the smallest 
dwarfs). If we restrict ourselves to ACDM models with <7g = 0.8 we expect 
that median rotation curves will peak at radii 



1.35 



r - = 1100p Hl^J ' (LB) 

Even with fixed as, the extrapolations are large and sensitive to halo-to-halo 
scatter in profile parameters (the 68% scatter is about a factor of ~ 2 in r max 
at fixed Vmax according to Springel et al. 2008). For the most extreme cases 
of ultra-faint dwarf galaxies, the stellar extent is so small (r^ ~ 30 — 40 pc) 
that we must rely on a factor of ~ 35 extrapolation in r in order to reach a 
typical Vmax value at r max ~ 1100 pc. One begins to wonder whether there 
might be a better option than V max for comparing theory to data. 



1.3.2 Kinematic Mass Determinations 

What quantity is best constrained by stellar velocity dispersion data? Con- 
sider a spherically-symmetric galaxy that is in equilibrium with stellar den- 
sity distribution /?*(r) and radial velocity profile a r (r) that is embedded 
within total mass profile M(r). The Jeans equation is conveniently written 

as 

M(r) = ^( 7 , + 7CT -2/3), (1.6) 

where (3{r) = 1 — of /of characterizes the tangential velocity dispersion and 
7* = — dhi/c^/dlnr and 7 CT = — dlnof/dlnr. In principle, a r (r) can be 
inferred from the observed line-of-sight velocity dispersion profile a\ os (R), 
but this geometric deconvolution depends on the unknown function /3(r). 
At this point one may begin to get worried: uncertainties in f3 will affect 
both the mapping from observed cji os to a r and the derived relationship 
between M(r) and a r in Equation 11.61 Given that the dSph galaxies are 
well outside the regime where we can legitimately assume that mass-follows 



| Estimated from the results of Springel et al. (2008) and Diemand et al. (2008) by chosing an 
intermediate normalization 
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light, are we forced to assume that (3 = in order to derive any meaningful 
mass constraint? Thankfully, no. 

It turns out that there is a single radius where the degeneracy between 
stellar velocity dispersion anisotropy and mass is minimized. As shown 
analytically and numerically by Wolf et al. (2010), this radius is close to 
the 3d half light radius, for most galaxies, and it is the mass with 

this radius, M\/2 = M(< ri/ 2 ), that is best constrained by kinematic data. 
Figure 11.41 demonstrates this by showing explicitly mass model constraints 
for the Carina galaxy. By exploring models with variable (3 values, several 
authors had seen similar results prior to the Wolf et al. (2010) derivation, 
but with more restrictive conditions and without a full exploration of the 
implications (van der Marel 2000; Strigari et al. 2007a; Walker et al. 2009b). 
Indeed, rather than extrapolate mass profiles to estimate a V ma _ x value for 
each dwarf halo, Strigari et al. 2007b and 2008 advocated for the use of the 
integrated mass within a pre-defined radius that was similar in magnitude 
to the median ri/ 2 value for the dwarf population (either 600 pc for large 
dwarfs, or 300 pc for smaller dwarfs) as a means of comparing data to 
simulations. I will discuss these comparisons below, but first let me turn to 
a brief explanation of why the mass is best constrained within Txn- 

Qualitatively, one might expect that the degeneracy between the inte- 
grated mass and the assumed anisotropy parameter will be minimized at 
some intermediate radius within the stellar distribution. Think about the 
line-of-sight velocity dispersion measured along the projected center (R = 0) 
and then at the far edge R = r e d ge of a spherical, dispersion supported galaxy 
that has stars extending to a 3d radius of r e d ge . At the center, line-of-sight 
observations will project onto the radial component with o"i os ~ a r , while 
at the edge of the same galaxy, line-of-sight velocities project onto the tan- 
gential component with o"i os ~ at- Consider a galaxy that is intrinsically 
isotropic (/3 = 0). If this system is analyzed using line-of-sight velocities 
under the false assumption that a r > at (f3 > 0), then the total velocity 
dispersion at r ~ would be underestimated while the total velocity dis- 
persion at r ~ 7" ec ig e would be overestimated. Conversely, if one were to 
analyze the same galaxy under the assumption that a r < at (/3 < 0), then 
the total velocity dispersion would be overestimated near the center and 
underestimated near the galaxy edge. It is plausible then that there is some 
intermediate radius where attempting to infer the enclosed mass from only 
line-of-sight kinematics is minimally affected by the unknown value of (3. 

A more quantitative understanding of the r 1 / 2 mass constraint (but less 
rigorous than that provided in Wolf et al. 2010) can be gained by rewrit- 
ing the Jeans equation such that the j3{r) dependence is absorbed into the 



16 




0.0 0.2 0.4 0.6 0.8 0.0 0.2 0.4 0.6 0.8 

3D Physical Radius [kpc] 3D Physical Radius [kpc] 



Fig. 1.4. Left: The cumulative mass profile generated by analyzing the Carina dSph 
using four different constant velocity dispersion anisotropies. The lines represent 
the median cumulative mass value from the likelihood as a function of physical 
radius. Right: The cumulative mass profile of the same galaxy, where the black line 
represents the median mass from our full mass likelihood (which allows for a radi- 
ally varying anisotropy). The different shades represent the inner two confidence 
intervals (68% and 95%). The green dot-dashed line represents the contribution of 
mass from the stars, assuming a stellar V-band mass-to-light ratio of 3 Mq/Lq. 
This figure is from Wolf et al. (2010). 

definition of of ot = (3 - 2/3)of : 

GMiry- 1 = (r) + a 2 r (r) (7* + 7,-3). (1.7) 

It turns out that to very good approximation, the log slope of the stellar 
light profile at r]/ 2 for a wide range of commonly- used stellar profiles is close 
to —3, such that 7*(r 1 / 2 ) = 3 (Wolf et al. 2010). Under the assumption 
that the velocity dispersion profile is flat (implying 7 CT <C 3) we see that at 
r = r i/2 the mass depends only on <r tot : 

M(r 1/2 ) ~ G , -V t 2 ot (r 1/2 )r 1/2 ~3G- 1 a> 1/2 . (1.8) 

In the above chain of arguments I have used the fact that cr* = (<r t 2 ot ) and the 
approximation (o" t 2 ot ) ~ ( \ ot iTi/2j- This second approximation works because 
the stellar-weighted velocity dispersion gets its primary contribution from 
the radius where 7* = 3 (Wolf et al. 2010). A cleaner way to write this 
mass estimator is 



K(ri/ 2 ) = VSa* ■ 



(1.9) 
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Fig. 1.5. Observed circular velocities V c (r 1 / 2 ) plotted versus r 1 / 2 for each of the 
Milky Way dSph galaxies discussed in Wolf et al. (2010). The circular velocity 
curve values for the data were determined using Eauation ll.9l For reference, we plot 
V(r) rotation curves for NFW subhalos obeying a median V max — r max relationship 
given by Equation 11.51 Each curve is labeled by its V max value (assumed to be in 
kms -1 ). Dwarf galaxy points are color-coded by their luminosities (see legend). 
Notice that the least luminous dwarfs (red) seem to fall along similar V(r) curves 
as the most luminous dwarfs (green). 

Since by definition Vm ax > V c (r 1 / 2 ) we can conclude that 

V max > V3a*. (1.10) 

As mentioned above, a popular assumption in the literature has been to 
assume that dwarf galaxy subhalos obey V max = VSa* (Klypin et al. 1999; 
Bullock et al. 2000; Simon & Geha 2007). It is likely that this common as- 
sumption under-estimated the Vmax values of MW and M31 satellite galaxies 
significantly. Figure [131 plots V c (r]/ 2 ) vs. r 1 / 2 for the sample of Milky Way 
dwarf galaxies discussed in Wolf et al. (2010). For comparison, the lines 
show V(r) rotation curves for NFW subhalos (Navarro et al. 1997) obeying 
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Fig. 1.6. Mass function for Af 300 = M(< 300pc) for MW dSph satellites and dark 
subhalos in the Via Lactea II simulation within a radius of 400 kpc. The short- 
dashed curve is the subhalo mass function from the simulation. The solid curve is 
the median of the observed satellite mass function. The error bars on the observed 
mass function represent the upper and lower limits on the number of configurations 
that occur with a 98% of the time (from Wolf et al., in preparation). Note that the 
mismatch is about ~ 1 order of magnitude at M300 ~ 10 7 Mq, and that it grows 
significantly towards lower masses. 



the V max — r max relationship given by Equation 11.51 Each curve is labeled 
by its Vmax value (assumed to be in kms -1 ). In many cases, the rotation 
curves continue to rise well beyond the r 1 j 2 value associated with each point. 
Note that all of these galaxies are consistent with sitting in halos larger than 
Vmax = 10 kms , but that in many cases the implied extrapolation is sig- 
nificant. 



1.3.3 Mass within a Fixed Radius: Mqqq and M300 

Strigari et al. (2007b) suggested that a more direct way to compare satellite 
galaxy kinematics to predicted subhalo counts was to consider their inte- 
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grated masses within a radius that is as close as possible to the median 
stellar radius for the galaxy population. By focusing on the mass within 
a particular (inner) radius, one is less reliant on cosmology-dependent as- 
sumptions about how the dark matter halo density profiles are extrapolated 
to Knax- This is particularly important if one is interested in simultaneously 
testing CDM models with variable as or if one is interested in more extreme 
variant models (e.g. dark matter from decays, Peter & Benson 2010; or warm 
dark matter, Polisensky & Ricotti 2010). At the time when Strigari et al. 
(2007b) made their suggestion, the smallest radius that was well resolved for 
subhalos in numerical simulations was r ~ 600 pc (Via Lactea I; Diemand et 
al. 2007). This motivated a comparison using Mqqq = M(r < 600 pc), which 
is reasonably well constrained for the 9 'classical' (pre-SDSS) dSph satellites 
of the Milky Way. They found that all but one of the classical Milky Way 
dSphs (Sextans) had an Mqoq mass consistent with (3—5) x 1O 7 M0. By com- 
paring to results from the Via Lactea I simulation, Strigari et al. (2007b) 
concluded that the Milky Way dwarf masses were indicative of those ex- 
pected if only the most massive halos (prior to accretion) were able to form 
stars. 

An updated version of this comparison (from Wolf et al., in preparation) 
is shown in Figure 11.61 where now we focus on integrated masses within 300 
pc. This shift towards a smaller characteristic radius is motivated by an 
advance in simulation resolution, which now enables masses to be measured 
fairly accurately within 300 pc of subhalo centers. It is fortuitous that 300 
pc is also close to the median r\/2 for Milky Way dSph galaxies with well- 
studied kinematic samples (Wolf et al. 2010; Bullock et al. 2010). The 
short dashed line in Figure 11.61 shows the M300 mass function from the Via 
Lactea II simulation (Diemand et al. 2008), while the solid line shows M300 
for the known Milky Way dSph galaxies. The masses are again indicative of 
a situation where only the most massive subhalos host galaxies. Even at the 
point where the dSph mass function peaks (M300 — 10 7 M ) the simulation 
over predicts the count by about a factor of 10. This order of magnitude 
mismatch between observed counts and predicted counts at fixed mass is a 
reasonably conservative statement of the MSP circa 2010. 

The fact that the M300 mass function of Milky Way satellites peaks sharply 
(within a factor of ~ 4) at a mass of M300 — 10 Mq is remarkable given that 
these galaxies span a factor of ~ 10 5 in luminosity. This point was high- 
lighted by Strigari et al. (2008), who found that the M300 mass-luminosity 
relation for observed dwarfs is remarkably flat, with M300 oc £°- 03 ± - 03 . The 
associated plot from Strigari et al. (2008) is shown in Figure 11.71 The im- 
portant point to take away from this plot is that there is no detectable trend 
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Fig. 1.7. Mass within 300 pc vs. V-band luminosity for classical (pre-SDSS) Milky 
Way dwarf satellites (blue) and ultra- faint (post-SDSS) satellites (red). The most 
important point to take away from this plot is that there is no obvious trend between 
mass and luminosity. The trend seems to demonstrate scatter, but the masses do 
not appear to vary systematically with luminosity. 



between galaxy mass and luminosity. Galaxies with Ly ~ 4000 Lq (like Ursa 
Major II) demonstrate median M300 masses that are very close to those of 
galaxies at Ly — 10 7 Lq (like Fornax). Note that the Milky Way dwarfs 
do not all share exactly the same mass. For example, Sextans is clearly less 
massive than Carina, Leo II, and Draco. The mass of Hercules has been 
revised downward since the time that the plot in Figure 11.71 was published 
(Aden et al. 2009) but this revision does not change the fact that there is 
a very weak relationship between mass and luminosity for the Milky Way 
dwarfs. 

Segue 1 is arguably the most interesting case. With a luminosity of 
just Ly ~ 340 Lq, the mass density of Segue 1 within its half-light ra- 
dius (ri/2 — 38 pc) is the highest of any Local Group dwarf: pi/ 2 — 1.6M Q 
pc -3 (Martinez et al. 2010). As can be seen by examining Figure [L5l (red 
point, second from the left), a density this high at ~ 40 pc can only be 
achieved within a subhalo that is quite massive, with V m3iX > lOkrns -1 or 
with M300 ^5 x 1O 6 M . Though the placement of Segue 1 on the Stri- 
gari plot requires significant extrapolation (from 40 pc to 300 pc), such an 
extrapolation is not unwarranted within the ACDM context because sub- 
halos that are this dense and this massive almost always extend to tidal 
radii larger than 300 pc (as can be inferred, for example, from Equation 
ll.5p . With a luminosity some five orders of magnitude smaller than that of 
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Fig. 1.8. Relationship between the mass within 300 pc and Vmax for subhalos in the 
Via Lactea II simulation (points from Diemand et al. 2008) along with the analytic 
relationship expected for NFW halos that obey equation II .51 (red solid line). 



Fornax, Segue 1 is consistent with inhabiting a dark matter subhalo with 
approximately the same potential well depth. 

Given that the M300 mass variable is adopted for practical (not physical) 
reasons, it is worth examining its relationship to the more familiar measure 
Knax- The relationship between the two variables is illustrated in Figure 
11.81 The points are Via Lactea II subhalos (kindly provided by the public 
release of Diemand et al. 2008) and the solid line is the analytic estimate 
for NFW subhalos that obey the V max — r max relation given by Equation 
11.51 For 60kms _1 > V max > 15kms _1 , the mass within 300 pc correlates 
linearly with V max as M300 — 10 7 M0(14nax/12.5kms _1 ). For smaller halos, 
the relationship asymptotes to M300 oc V^ ax . The point to take away from 
Figure [TTHI is that the M300 variable is at least as sensitive to potential well 
depth as the more familiar variable V^n ax . 
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1.3.4 Making Sense of the Luminosity-Mass Relation 

How does the observed relationship between luminosity and mass compare to 
simple empirical expectations that have been gained from examining more 
massive halos? The symbols in Figure 11.91 show inferred Vmax values for 
Milky Way dwarfs plotted as a function of luminosity. I have opted to use 
Vmax as the scaling variable here (rather than M300) in order to make direct 
contact with more traditional scaling relations. The error bars on each 
point are typically larger (in a relative sense) than they are in Figure 11.71 
because the extrapolation to V mBX is more uncertain than the extrapolation 
to M300 in most cases. Nevertheless, the same global trend holds: there is a 
very weak correlation between luminosity and Vmax for the Milky Way dSph 
population. Note that I have extended the luminosity axis slightly in order 
to include Vmax estimates for the LMC and SMC (from van den Bergh 2000, 
who discusses the signifiant but unquantified uncertainties on mass estimates 
for the LMC and SMC). The lower red dashed line is the extrapolated Tully 
Fisher relation for brighter spiral galaxies from Courteau et al. (2007) and 
the upper blue dashed line is the relation one obtains from extrapolating the 
abundance matching power-law discussed in reference to Figure 11.11 above 
(specifically the Vmax — L relation advocated by Busha et al. 2010). The fact 
that the naive extrapolation of the abundance matching power law provides 
a reasonable match at L ~ 10 4 L & is encouraging. Nevertheless, there are 
some surprising points of disagreement. 

First, the data clearly demonstrate a flatter trend with luminosity than 
either of these scaling relations. The disagreement with the Tully Fisher 
line is less of a concern, as there is no reason to suspect that spheroidal 
galaxies should obey the same scaling relations as spiral galaxies. Abun- 
dance matching, on the other hand, is a global measure of the Vm ax — L 
mapping that is required to produce the correct abundances of galaxies. Of 
particular interest in Figure [L9l is the mismatch between abundance match- 
ing and dwarf properties for the most luminous dwarfs. The most luminous 
dSph galaxies at L ~ 10 7 L Q prefer V max — 20kms _1 , while the abundance 
matching expectation at this luminosity is much higher Vnax — 50kms _1 . 
Note that a completely independent analysis by Strigari et al. (2010) find 
similarly low Vm ax values for the luminous dSph galaxies. Of course, one 
possible explanation is that the subhalos that host these objects have lost a 
significant amount of dark matter mass, but several studies have looked at 
this effect in more detail and found that mass loss cannot easily account for 

f The dSph satellites of the Milky Way also deviate from the Baryonic Tully Fisher relation 
(McGaugh & Wolf 2010) as would be expected for systems that contain no detectable gas 
(except for Leo T; Grcevich &: Putman 2009). 
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Fig. 1.9. The V max vs. V-band luminosity relation for the Milky Way satellite 
population, as inferred from assuming that dSph galaxies sit within NFW dark 
matter halos that obey the same scaling relations as do subhalos in ACDM N-body 
simulations. The lower red dashed line is the Tully Fisher relation from Courteau 
et al. (2007) extrapolated to low lumnosities and the upper blue dashed line is the 
relation one obtains from extrapolating the abundance matching power-law from 
Busha et al. (2010). 



this difference (Busha et al. 2010; Bullock et al. 2010). Indeed, account- 
ing for the fact that dSph satellite galaxies are subhalos simply lowers the 
normalization of the blue-dashed line in Figure [TTTJl by about 50 %, without 
changing the slope to any appreciable agree. The reason for this is that sig- 
nificant mass loss tends to have occurred only in systems that were accreted 
at early times. At earlier times, larger V m&yi values are required to produce 
the same stellar luminosity (e.g. Moster et al. 2010; Behroozi et al. 2010), 
and this compensates for the fact that V max tends to decrease with time 
once a subhalo is accreted. 

One possible explanation for the lack of trend between V max (or M300) and 
luminosity is that we are seeing evidence for a real scale in galaxy formation 
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at Vmax — 15kms 1 or M300 — 10 7 M Q , which is remarkably close to the 
scale imposed by photoionization suppression and the HI cooling limit (Li 
et al. 2009; Maccio et al. 2009; Okamoto et al. 2009; Stringer et al. 2010). 
This characteristic V max /Msoo scale is well above the scale that would be 
indicative of a halo that required H2 to cool: these are unlikely to be pre- 
reionization fossils of first light star formation (Madau et al. 2008; Ricotti 
2010). 

A second explanation, which seems less likely, is that we are seeing a scale 
in dark matter clustering, which just happens to be very close to the mass 
scale where natural astrophysical suppression should kick in. 

A third possibility, addressed in the next section, is that the lack of an 
observed trend between mass and luminosity is the product of selection bias: 
most ultrafaint galaxies do inhabit halos with M300 ^$ 1O 7 M , but they are 
too diffuse to have been discovered. If so, this implies that searches for the 
lowest mass fossil galaxies left over from reionization may be hindered by 
surface brightness limits (Bovill & Ricotti 2009). 

Finally, it is possible that the apparent flatness in the V max / M^qq _ lu- 
minosity relationship for Milky Way dwarfs is simply an accident of small 
statistics or an artifact of misinterpretation of the data. Fortunately, as 
we now discuss, there is good reason to believe that the overall count of 
dwarf galaxies will grow significantly over the next decade. These discoveries 
should enable larger statistical samples of dwarfs with kinematically-derived 
masses and a more stringent investigation of the trends that appear to be 
present in the data at the current time. 



1.4 Empirical Evidence for Missing Satellites 

The Sloan Digital Sky Survey (SDSS; Adelman-McCarthy et al. 2007) has 
revolutionized our understanding of the Milky Way's environment. In par- 
ticular, searches in SDSS data have more than doubled the number of known 
Milky Way satellite galaxies over the last several years (Willman et al. 2005; 
Zucker et al. 2006; Grillmair 2006, 2009; Majewski et al. 2007; Belokurov 
et al. 2007, 2009; Martin et al. 2009), revealing a population of galaxies 
that were otherwise too faint to have been discovered (with L < 10 5 L@). In 
addition to providing fainter detections, the homogeneous form of the SDSS 
has allowed for a much better understanding of the statistics of detection 
(Koposov et al. 2008; Walsh et al. 2009). The Milky Way satellite census 
is incomplete for at least two reasons: sky coverage and luminosity bias. A 
third source of incompleteness comes from the inability to detect and verify 
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dwarf galaxies with very low surface brightness. We turn to this last point 
at the end of this section. 

1.4-1 Sky Coverage 

Resolved star searches have covered approximately ~ 20% of the sky, so it is 
reasonable to expect that there is a factor of c s k y — 5 more satellites fainter 
than L = 10 5 L@ , bringing to the total count of Milky Way dwarf satellites 
to N ~ iVp r e-sDSS + c sky iV SDSS ~ 9 + 5 x 12 ~ 70. 

Note that the estimate c s k y ~ 5 assumes that there is no systematic angu- 
lar bias in the satellite distribution. If the satellite distribution is anisotropic 
on the sky, the value of c s k y could be significantly different from 5. If, for 
example, the SDSS happens to be viewing a particularly underdense region 
of the halo, then the correction for covering the whole sky could be quite 
large c s k y 3> 5. Tollerud et al. (2008) used subhalos in Via Lactea I simu- 
lation (Diemand et al. 2007) to estimate the variance in c s k y for SDSS-size 
pointings within the halo and found that c s k y can vary from just ~ 3.5 up 
to ~ 8.3 depending on the mock surveys orientation. In principle, then, 
one might expect as many as 110 satellites (or as few as 50) from the sky 
coverage correction alone. 

1.4-2 Luminosity Bias 

The second source of number count incompleteness, luminosity bias, is more 
difficult to quantify because it depends sensitively on the (unknown) radial 
distribution of all satellites. Koposov et al. (2008) and Walsh et al. (2009) 
both found that detection thresholds are mostly governed by the distance 
to the object and the object luminosity, as long as the surface density of the 
dwarf was brighter than about /i = 30 mag arcsec -2 . 

Tollerud et al. (2008) used the Koposov et al. (2008) results to show that 
the SDSS is approximately complete down to a fixed apparent luminosity: 

i? comp ^66kpc( I ^ I -) ' , (1.11) 

where i? C omp is a spherical completeness radius beyond which a dSph of 
a particular luminosity will go undetected. The implied relationship be- 
tween galaxy luminosity and corresponding heliocentric completeness radius 
is shown by the solid curve in Figure [T. 101 The horizontal dotted line in Fig- 
ure [TTTO] marks a generous estimate of i?h — 417 kpc for the Milky Way halo 
edge. We see that only satellites brighter than L ~ 10 5 L & are observable 
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Fig. 1.10. The completeness radius for dwarf satellites from Tollerud et al. (2008). 
The three rising lines show the helio-centric distance, fi comp , out to which dwarf 
satellites of a given absolute magnitude are complete within the SDSS DR5 survey. 
The solid is for the published detection limits in Koposov et al. (2008) and the 
other lines are described in Tollerud et al. (2008). The dotted black line at 417 kpc 
corresponds to generous characterization of the outer edge of the Milky Way halo. 
The data points are observed satellites of the Milky Way and the Local Group. The 
red circles are the SDSS-dctected satellites, the only galaxies for which the detection 
limits actually apply, although the detection limits nevertheless also delineate the 
detection zone for more distant Local Group galaxies (yellow diamonds). Purple 
squares indicate classical Milky Way satellites. The faintest object (red triangle) is 
Segue 1, which is outside the DR5 footprint. 

out to this radius. The fact that the faintest dwarf satellite galaxies known 
are more than 100 times fainter than this limit immediately suggests that 
there are many more faint satellite galaxies yet to be discovered. 

Once armed with a completeness radius - luminosity relation like that 
given in Equation II. 11\ one only needs to know the radial distribution of 
satellites N(< R) in order to estimate the total count of faint galaxies out to 
some pre-defined edge of the Milky Way (we follow Tollerud et al. and use 
Rh = 417 kpc). For example, if we knew that there were iV b s Milky Way 
dwafs within A L of some luminosity L Q bs then Equation 11.111 tells us that 
the census of these objects is complete to -Rcomp(-^obs)- We may estimate 



Notes on the Missing Satellites Problem James. S. Bullock (UC Irvine) 27 

3 4 5 6 7 8 9 

3 10 10 10 10 10 10 10 




-2 -4 -6 -8 -10 -12 -14 -16 



Fig. 1.11. Luminosity function of dSph galaxies within i?h = 417 kpc of the Sun 
as observed (lower), corrected for only SDSS sky coverage (middle), and with lu- 
minosity completeness corrections from Tollerud et al. (2008) included (upper). 
Note that the brightest, classical (pre-SDSS) satellites are uncorrected, while new 
satellites have the correction applied. The shaded error region corresponds to the 
98% spread over mock observation realizations within the Via Lactea I halo. 



the total number of galaxies between L b s and L obs + AL using 

N tot (L obs ) ~ c sky iV obs R ^ . (1.12) 

iV (< -Kcomp(-kobsJ) 

If we make the assumption that satellite galaxies are associated with subha- 
los in a one-to-one fashion, then iV(< R^)/N{< R) may be estimated from 
analyzing the radial distribution of ACDM subhalos. Tollerud et al. (2008) 
showed that the implied ratio N{< R^)/N(< R) is almost independent of 
how the subhalos are chosen. As an example, consider the correction implied 
for the iV b s = 2 known Milky Way dwarfs that have L obs ~ IOOOLq. For this 
luminosity, we are complete to i? C omp = 66 kpc. The subhalo distributions 
presented in Tollerud et al. (2008) obey A^(< 417kpc)/N(< 66kpc) = 5-10 
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Fig. 1.12. Maximum radius for detection of dSphs as estimated by Tollerud et al. 
(2008) shown as a function of galaxy absolute magnitude for DR5 (assumed limiting 
r-band magnitude of 22.2) compared to a single exposure of LSST (24.5), co-added 
full LSST lifetime exposures (27.5), DES or one exposure from PanSTARRS (both 
24), and the SkyMapper and associated Missing Satellites Survey (22.6). The data 
points arc SDSS and classical satellites, as well as Local Group field galaxies. 

for a wide range of subhalo population choices. This suggests a total count 
of L ~ 1000 L Q galaxies is N tot ~ 50 - 100. 

Figure IT . 11 1 presents a more exacting correction of this kind from Tollerud 
et al. (2008). The lower (red) curve shows the observed cumulative lumi- 
nosity function of Milky Way dSph galaxies. The middle (green) curve is 
corrected for SDSS sky coverage only and yields ~ 70 galaxies brighter than 
L = IOOOLq. The upper curve has been corrected for luminosity complete- 
ness (using Via Lactea I subhalos for N(jR)). The result is that we expect 
~ 400 galaxies brighter than 1000 L & within ~ 400 kpc of the Sun. The cor- 
rection becomes more significant for lower luminosity systems because the 
completeness radius is correspondingly smaller. Note that this luminosity 
bias correction would be even more significant if one took into account the 
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fact that the presence of a central disk will tend to deplete subhalos (and 
associated galaxies) in central regions of the Galactic halo (D'Onghia et al. 
2010). This effect will tend to increase the fraction of galaxy subhalos at 
large Galactocentric distances. 

Referring back to Figure [L3l and Equation [L3j we see that N ~ 400 satel- 
lites would be about the satellite count expected only for V max > 10km s -1 
halos. It is encouraging that this is close to the V m3iX limit inferred directly 
from stellar kinematics, as illustrated in Figure fl~9l Of course, as discussed 
in Tollerud et al. (2008) and Walsh et al. (2009), the overall luminosity 
bias correction is sensitive at the factor of ~ 2 level to the precise subset 
of subhalos that are used to determine the radial distribution of galaxies. 
Generally, the best estimates suggest that there are hundreds of Milky Way 
satellite galaxies lurking in the outer reaches of the Milky Way. 

There is real hope that these missing satellites will be detected as surveys 
like LSST, DES, PanSTARRS, and SkyMapper cover more sky and provide 
deeper maps of the Galactic environment (see Ivezic et al. 2008; The Dark 
Energy Survey Collaboration 2005; Kaiser et al. 2002; Keller et al. 2007). 
Figure 11.121 from Tollerud et al. (2008) provides a rough determination 
of the completeness radius for several planned surveys. According to this 
estimate, LSST will be able to detect objects as faint as L ~ 500 L out to 
the edge of the Milky Way halo. 



1.4-3 Surface Brightness Limits and Stealth Galaxies 

Importantly, the luminosity-distance detection limits discussed above only 
apply for systems with peak surface brightness obeying fj, < 30 mag arcsec 2 
(Koposov et al. 2008). Any satellite galaxy with a luminosity of L ~ 1000L© 
and a half-light radius r 1( / 2 larger than about 100 pc would have evaded 
detection with current star-count techniques regardless of its distance from 
the Sun. This phenomenon is illustrated in Figure 11.131 where I plot the 
2d (projected) half-light radius R e vs. L for Milky Way dSph galaxies. 
The solid line shows a constant peak (central) surface brightness of u = 
30 mag arcsec -2 . The tendency for many of the fainter dwarfs to line up 
near the surface brightness detection limit is suggestive. There is nothing 
ruling out the presence of a larger population of more extended systems that 
remain undetected because of their low surface brightness. I refer to these 
undetected diffuse galaxies as stealth galaxies. 

If a large number of stealth galaxies do exist, they are likely associated 
with low-mass dark matter subhalos. If so, this will make it difficult to detect 
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Fig. 1.13. Half-light radius R e vs. V-band luminosity for Milky Way dSph galaxies. 
Galaxies above the solid line, with surface brightness fainter than [i = 30 mag 
arcsec -2 are currently undetectable. 



the lowest mass halos and will introduce a systematic bias that avoids low 
M300 masses in the Strigari plot (Figure II .T[) . 

One can understand this expectation by considering a galaxy with velocity 
dispersion <r* and luminosity L embedded within a gravitationally-dominant 
dark matter halo described by a circular velocity curve that increases with 
radius as an approximate power law: V c (r) = V300 (r/300pc) a . Equation 
11.91 implies 

n /2 = 300pc(V3^/V3oo) 1/a • (1.13) 

For an NFW halo (Navarro et al. 1997) with scale radius r max 3> 600 pc we 
have a = 1/2 and 



n/2 « 



r ^T7^- (1-14) 

> ^300 / JW300 

Clearly, r\/2 increases as we decrease M300 at fixed <r*. One implication 
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Fig. 1.14. Illustration of possible selection bias in the mass-luminosity plane of 
Milky Way dwarfs. The data points with errors reproduce the Strigari et al. (2008) 
M300 masses presented in Figure 11.71 Galaxies within in the shaded region (below 
dashed line) will remain undetected (/x > 30 mag arcsec -2 ) if they have velocity 
dispersions > 4km s _1 ( ct* > 5kms _1 ). 



is that if a galaxy has a stellar surface density X* oc L/r^ 2 that is just 
large enough to be detected, another galaxy with identical L and <7* will be 
undetectable if it happens to reside within a slightly less massive halo. 

Figure [T. 141 from Bullock et al. (2010) provides a more explicit demonstra- 
tion of how surface brightness bias can effect the mass-luminosity relation 
of dwarf galaxies. The points show M300 vs. L for Milky Way dSph galax- 
ies, with masses from Strigari et al. (2008) and luminosities updated as in 
Wolf et al. (2010). The region below the shaded band is undetectable for 
dwarf galaxies with a* > 4kms~ 1 . The region below the dashed line is 
undetectable for dwarf galaxies with a* > 5kms _1 . 

This result suggests that surface brightness selection bias may play a role 
explaining the lack of observed correlation between luminosity and mass for 
Milky Way satellites. It also implies that searches for the lowest mass 'fossil' 
galaxies left over from reionization may be hindered by surface brightness 
limits. This latter point was made earlier by Bovill &: Ricotti (2009). Ac- 
cording to estimates in Bullock et al. (2010), potentially half of several 
hundred satellite galaxies that could be observable by surveys like LSST 
are stealth. A complete census of these objects will require deeper sky sur- 
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veys, 30m-class follow-up telescopes, and more refined methods to identify 
extended, self-bound groupings of stars in the halo. 



1.5 Summary 

Advances in simulation technology have solidified the decade-old expecta- 
tion that substructure should be abundant in and around CDM halos, with 
counts that rise steadily to the smallest masses. Properties of observed satel- 
lites in and around the Local Group provide an important means to test this 
prediction. 

The substructure issue has gained relevance over the years because it 
touches a range of important issues that span many subfields, including the 
microphysical nature dark matter, the role of feedback in galaxy formation, 
and star formation in the early universe. It is in fact difficult to state directly 
what it would take to solve the MSP without first associating with a specific 
subfield. From a galaxy formation standpoint, one important question is 
to identify the primary feedback sources that suppress galaxy formation in 
small halos. An associated goal is to identify any obvious mass scale where 
the truncation in the efficiency of galaxy formation occurs. 

It is particularly encouraging for ACDM theory that both direct kinematic 
constraints on the masses of Milky Way satellite galaxies and completeness 
correction estimates both point to about the same mass and velocity scale, 
Vmax ~ 15kms _1 or M300 — 1O 7 M0. The luminosity bias correction dis- 
cussed in association with Figure 11.111 suggests that the Milky Way halo 
hosts about 400 satellite galaxies with luminosities similar to the faintest 
dwarf galaxies known. As seen in either Figure 11.31 or Figure 11.21 a total 
count of 400 satellites is approximately what is expected at a minimum 
Knax threshold of 15kms _1 - the same V max scale evidenced in the kine- 
matic measurements shown in Figure 11.91 These numbers did not have to 
agree. Interestingly, they both point to a mass scale that is close to limit 
where cooling via atomic hydrogen is suppressed and where photoionization 
heating should prevent the accretion of fresh gas (see Dekel 2005). 

More puzzling is the overall lack of observed correlation between Milky 
Way satellite galaxy luminosities and their M300 masses or V max values (see 
Figures 11.71 and 1 1 . 9[) . Most of the models that have been constructed to 
confront the mass-luminosity over-predict V m &x values at for the brightest 
dwarfs and under-predict them for the faintest dwarfs (Li et al. 2009; Maccio 
et al. 2009; Okamoto et al. 2009; Busha et al. 2009; Bullock et al. 2010). 
These differences may reflect small-number statistics at the bright end, and 
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a potential observational bias associated with surface brightness at the faint 
end. 

Section 11.4,31 discussed the idea that there may be population of low- 
luminosity satellite galaxies orbiting within the halo of the Milky Way that 
are too diffuse to have been detected with current star-count surveys, de- 
spite the fact that they have luminosities similar to those of known ultrafaint 
dSphs. These stealth galaxies should preferentially inhabit the smallest dark 
matter subhalos that can host stars (Vmax ^5 15 kms -1 ). One implication 
is that selection bias may play a role explaining the lack of observed cor- 
relation between L and M300 for Milky Way satellite galaxies. This effect 
also implies that searches for the lowest mass 'fossil' galaxies left over from 
reionization may be hindered by surface brightness limits. A large number 
(more than one hundred) stealth galaxies may be orbiting within the Milky 
Way halo. These systems have stellar distributions too diffuse to have been 
easily discovered so far, but new surveys and new techniques are in the works 
that may very well reveal hundreds of these missing satellite galaxies within 
the next decade. 
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